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Abstract 



Let E = (V, E) be a finite, d-regular bipartite graph. For any A > let 
tt\ be the probability measure on the independent sets of E in which the set 
/ is chosen with probability proportional to A^l (ir\ is the hard-core measure 
with activity A on E). We study the Glauber dynamics, or single-site update 
Markov chain, whose stationary distribution is ir\. We show that when A is 
large enough (as a function of d and the expansion of subsets of single-parity 
I/-) . of V) then the convergence to stationarity is exponentially slow in |V(E)|. In 

\ particular, if E is the d-dimensional hypercube {0, l} d we show that for values 

of A tending to as d grows, the convergence to stationarity is exponentially 
slow in the volume of the cube. The proof combines a conductance argument 
^2 ', with combinatorial enumeration methods. 

(N , 

1 Introduction and statement of the result 

Let E = (V, E) be a simple, loopless, finite graph on vertex set V and edge set E. 
(For graph theory basics, see e.g. [3], [7].) Write X(E) for the set of independent sets 
(sets of vertices spanning no edges) in V. For A > we define the hard-core measure 
with activity A on X(E) by 

nxi{I}) = Y7z) {oTleZ W 
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where -Za(E) = Yliei ^ 1S ^ ne appropriate normalizing constant. We will often write 
w x (I) for A |r| and, for JCI, iy A (,7) for £ JeJ u> A (J). 

The hard-core measure originally arose in statistical physics (see e.g. [HIT]) where 
it serves as a simple mathematical model of a gas with particles of non-negligible size. 
The vertices of £ we think of as sites that may or may not be occupied by particles; 
the rule of occupation is that adjacent sites may not be simultaneously occupied. The 
activity parameter A measures the likelihood of a site being occupied. 

The measure also has a natural interpretation in the context of communications 
networks (see e.g. [14J ) . Here the vertices of £ are thought of as locations from 
which "calls" can be made; when a call is made, the call location is connected to 
all its neighbours, and throughout its duration, no call may be placed from any of 
the neighbours. Thus at any given time, the collection of locations from which calls 
are being made is exactly an independent set in the graph. If calls are attempted 
independently at each vertex as a Poisson process of rate A and have independent 
exponential mean 1 lengths, it can be shown that the long-run stationary distribution 
of this process is the hard-core measure on E. 

Our particular focus in this paper is the mixing time of the Glauber dynamics, or 
single-site update Markov chain, for this model. The measure it\ can be realized as 
the stationary distribution of a certain Markov chain. Specifically, consider the chain 
M.\ = M.\(Y1) on state space X(E) with transition probabilities P\(I, J), I, J G ^(£), 
given by 



Px(I, J) 



if|JAJ|>l 

jlTf TTa if |J A J| = 1, I C J 

^ if|IAJ| = l, JCI 



Underpinning the definition of M.\ is the following dynamical process, known as the 
Glauber dynamics on X(S). From an independent set /, the process follows three 
steps. The first step is to choose a vertex v uniformly from V. The second step is to 
"add" v to I with probability A/(l + A), and "remove" it with probability 1/ (1 + A); 
that is, to set 

j, ( I U {v} with probability 

\ I\{v} with probability y+\~ 

The third step is to move to I' if it is a valid independent set, and stay at / otherwise. 

It is readily checked that A4\ is an ergodic, aperiodic, time reversible Markov 
chain with (unique) stationary distribution ttx- A natural question to ask about Ai\ 
is how quickly it converges to its stationary distribution. To make this question 
precise, we need a few definitions. 

Let Ai be an ergodic Markov chain on state space Q, with transition probabilities 
P : Q 2 — )• [0, 1]. For a state uo G f2, denote by P t (uo, •) the distribution of the state at 
time t, given that the initial state is cjq, and denote by n the stationary distribution. 
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Define the mixing time of Ai by 



t m = max min < t : —} \P t (oJo, to) — n(u))\ < - \/t>t >. 
ui en 2 e 

^ wen J 

The mixing time of Ai captures the speed at which the chain converges to its station- 
ary distribution: for every e > 0, in order to get a sample from Q which is within e 
of 7r (in variation distance), it is necessary and sufficient to run the chain from some 
arbitrarily chosen distribution for some multiple (depending on e) of the mixing time. 

Much work has been done on the question of bounding r^ A . The strongest general 
result available to date is due to Vigoda [21] who showed that if £ is any n-vertex 
graph with maximum degree A, then r_A4 A (E) = O (n log n) whenever A < 2/ (A— 2). In 
the other direction, Dyer, Frieze and Jerrum [H] considered the case A = 1 and showed 
that for each A > 6 a random (uniform) A-regular, n- vertex bipartite £ almost surely 
(with probability tending to 1 as n tends to infinity) satisfies tm x (^) > 2 7 ™ for some 
absolute constant 7 > 0. 

Here, we continue in the spirit of [9] and construct explicit families of graphs for 
which Glauber dynamics mixes slowly. Specifically, we establish a certain expansion 
condition in a regular bipartite graph £ that forces t.m a (e) to be (almost) exponential 
in \ V\ provided A is suitably large (as a function of the expansion). The d- dimensional 
hypercube {0, l} d satisfies this condition for A > u{dr l / i log 3//2 d). 

Our work is partly motivated by [5] where a study was made of Glauber dynamics 
for the hard-core measure on the even discrete torus T^. This is the graph on 
{0, . . . , L — l} d (with L even) in which two strings are adjacent if they differ on only 
one coordinate, and differ by 1 (mod L) on that coordinate. It was shown in [5] that 
for A growing exponentially in d (with a suitably large base), t~m x (t l d ) is exponential 
111 cL d - 1 / log 2 L for some c that depends on d but not on L. 

In light of a recent result of Galvin and Kahn [T2], we found it tempting to believe 
that slow mixing on Tl^ should hold for much smaller values of A; even for some 
values of A tending to as d grows. The main result of [12] is that the hard-core 
model on Z d exhibits multiple Gibbs phases for A > Cd~ 1 ^ log 3//4 d for some large 
constant C. Specifically, write £ and O for the sets of even and odd vertices of Z d 
(defined in the obvious way: a vertex of Z d is even if the sum of its coordinates is 
even). Set 

A M = A d M = [-L, L} d , dA M = [-L, L] d \ [—(L — i),L — l] d . 

For A > 0, choose I from X(Am) with Pr(I = I) oc A' 7 '. The main result of [12] is 
that there is a constant C such that if A > Cd^ 1 ^ 4 log 3//4 d then 

lim Pr(0 G I|I D dA M n 8) > lim Pr(0 E I|I D dA M n O). 

Af— >oo M— >oo 
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Thus, roughly speaking, the influence of the boundary on behavior at the origin per- 
sists as the boundary recedes. Informally, this suggests that for A in this range, the 
typical independent set chosen according to the hard-core measure is either predom- 
inantly odd or predominantly even. Thus there is a highly unlikely "bottleneck" set 
of balanced independent sets separating the predominantly odd sets from the pre- 
dominantly even ones. It is the existence of this bottleneck that should cause the 
conductance of the Glauber dynamics chain to be small (see Section |2J), and thus 
cause its mixing time to be large. 

Our main result (Theorem II. ip provides some support for this belief, verifying it 
in the case L = 2; unfortunately, because of the weak isoperimetry of the torus we 
cannot hope to use Theorem II. II to deal with general L. (See Remark 11.61 for further 
discussion of these issues.) 

Before stating Theorem II .1| we establish some notation. From now on, S = (V, E) 
will be a d- regular, bipartite graph with partition classes S and O. Set A/" = |V| and 
M = \S\ = \0\ (= N/2). 

For u, v G V we write u ~ v if there is an edge in £ joining u and v. Set 
N(u) = {w G V : w ~ u} (N(u) is the neighbourhood of u) and N(A) = U we AN(w). 
For ACS (or O) set 



we think of [A] as an "external closure" of A. Note that while A determines N(A), 
N(A) determines only [A]. For this reason, we find it more convenient at some points 
in the sequel to deal with [A] rather than with A itself. Say that A is small if 
\[A]\ < M/2. Define the bipartite expansion constant of E by 



Note that < 5 < 1. (The second inequality is obvious. The first follows from 
regularity, which implies that S has a perfect matching, which in turn implies that 
for all ACS (or O), \A\ < \N(A)\.) 

All implied constants in O and Q notation are independent of d. We use "log" 
throughout for log 2 and "In" for log e . We write exp 2 x for 2 X . We always assume that 
d is sufficiently large to support our assertions. 



[A] = {xe V(E) : N(x) C N(A)}; 



5(E) = min 



\N(A)\-\[A]\ 
\N(A)\ 



ACS small or A C O small, A ^ 



Set 




log(l + A) 




and 



/3(A) 



log 2 (l + A) 



log(l + A)+log(d 5 /£)' 
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We note for future reference that 



A > _L impte L> aW = n (^L_) . (3 ) 

Our main result is 

Theorem 1.1 Lei be a d-regular, bipartite graph with N > d 2 vertices and bipartite 
expansion constant 5. There is a constant c > such that whenever A and 5 satisfy 

/?(A)>cmaxj— j (4) 

we have 

tm a( e) >exp 2 {fi(iVa(A)/3(A)5)}. 

Remark 1.2 If we add as an additional hypothesis to Theorem }! . l\ that £ has bounded 
codegree (that is, there is a constant k independent of d such that each pair of vertices 
in S has at most k common neighbours) , then we can slightly improve our bound on 
A to 

f io g (rf 5 A) log eg 

/3(A)>cmax|— |. (5) 

We do not present the more complicated argument here. 

Note that since 5 < 1, we cannot possibly satisfy fll]) for A < l/y/d, so we may 
(and will) assume from here on that A > 1/ \fd. 

A slightly stronger condition that implies fl4]) is 

/ iog(rf 5 A) logtyiog(^) 

log(l + A) >c max<{ rfl/4 , [> (6) 



where the constant d depends on c, from which we can more clearly see the tradeoff 
between A and 5. From ([6]) we may also read off the following corollary of Theorem 
11.11 addressing Glauber dynamics for sampling a uniform independent set (A = 1). 



Corollary 1.3 Let S satisfy the conditions of Theorem There is a constant 

c > such that whenever 5 > clog 3 d/d we have 

N5 



TMiQS) > ex P2 { ^ 



log d 
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As an application of Theorem II .![ we consider the case £ = Qd, the c?-dimensional 
hypercube. This is the <i-regular bipartite graph on vertex set {0, l} d in which two 
vertices are adjacent if they differ on exactly one coordinate. The hypercube satisfies 
5(Qd) > £1(1 /yfd) (see, e.g. [TBI Lemma 1.3]; this bound can also be derived from 
isoperimetric inequalities of Bezrukov [2] and Korner and Wei [15]) and so if d > 
is a suitably large constant (depending on the constant c provided by Theorem 11.11) 
then (jlj) is satisfied as long as A > c'd~ 1//4 log 3 / 2 d. So the following is a corollary of 
Theorem 11.11 



Corollary 1.4 There are constants c, d > such that whenever A > cd 1 / 4 log 3 ^ 2 d 
we have 



c'2 rf log 3 (l + A) 



r Mx (Q d ) > exp 2 



In particular, 



Vd(l + log(l + A)) (eg log d + log(l + A)) log (2 + 



T~M x {Q d ) > < 



2 d log 3 (l+A) 
eXp 2i v^log 2 ^ 



exp 2 



2"log(l+A) 
eXp 2 1 yft 




} if cd- l /Hog z,2 d < A < 0(1), 



if tt{l) < A < 0(d), 



if tt(d) < A. 



Remark 1.5 Using $5\) in place of 0]) (which we may do, since Qd has bounded 
codegree) we may improve the bound on A in Corollary \l-4\ to A > cd~ l ^\ogd. 



Remark 1.6 Let us return to T^, the even discrete torus. Since Qd is easily seen 
to be isomorphic to T 2 (i; Corollary \1.4\ gives an exponential lower bound on TM x {T 2 d) 
for sufficiently large d whenever A > w(<i~ 1//4 log 3//2 <i). Unfortunately, the best bound 
we can obtain on the bipartite expansion constant of Ti,d is S(Tl^) > £1(1/ Ld) (see, 
e.g fTB/ ). so we cannot use Theorem \l.l\ to obtain any lower bound on A independent 
of L beyond which t Mx (t l d ) is large for all even L > 4 and sufficiently large d. 
However, subsequent to the completion of this paper, a strategy specific to the torus 
has been employed in FIT) / to show that for all even L > A, A > u(d~ 1 ^ 4 log 3 / 4 d) and 
d sufficiently large, 



2 Proof of Theorem 1.1 



The notion of conductance, introduced in [13], can be used to analyze the behavior 
of t_a4 a . Let M. be a Markov chain on state space £l with transition matrix P and 
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stationary distribution it. For Ui,u>2 € Q and A, B C set 

w 2 ) = 7r(wi)P(a;i, w 2 ) and Q(A,.B) = <3(wi,w 2 ) 

For 7^ S 1 C f2, define the conductance of 5 as 

Q(3,n\S) 



7t(S) 



We may interpret as the probability under 7r that the chain escapes from S in 

one step, given that it is in S. Define the conductance of M. as 

<&M = min <&(S). 

0<7r(S)<± 

We may then bound the mixing time of Ai by 

(see e.g. [9], where the above bound is derived without assuming time- reversibility of 
the chain M). Thus to show that the mixing time is large, it is enough to exhibit a 
single S with small conductance. 

Throughout this section we fix £ satisfying the conditions of Theorem 11.11 Set 

i £ = {/gi(s) : \ms\ > \mo\}, 

define Iq analogously, and set X& = X(£) \ (Is U X©) (X& is the set of balanced inde- 
pendent sets). Without loss of generality, assume n\{Is) < 1/2- Because Glauber 
dynamics changes the size of an independent set by at most one at each step, we have 
that if / G Ig, J ^ Is satisfy P\(I, J) 7^ 0, then J e I b . It follows that 

Q(Is,n\I s ) = *x(I)Px(I,J) 

= J2 *x{J)Px{J,I) (8) 

IG1 £ ,J£1 E 

= Yl *x(J)Px(JJ) 
i&x £ ,Jex b 

< vr A (X 6 ). 

The simplest way to see (jHJ) is to use the fact that M.\ is time- reversible (that is, 
that ir\(I)P\(I, J) = tt\(J)P\(J, I) for all I, J G X); but note that more generally if 
M. is a (not necessarily time-reversible) Markov chain on finite state space Q with 
transition matrix P and stationary distribution tt then 

7r(u;i)P(u;i,u;2) = ^2)^(^2,^1) 



for all SC.fl. Now using the trivial lower bound W\(Xg) > (1 + A) (recall that for 
J CX, u x (J) = Ej e j A |J| ) we obtain 



$m < ms) < = WxiXb) < Wx{Ib> o) 



Thus (recalling (JTJ)) to show that tm x is large, it is enough to show that w\(Ib) is 
small. We may think of Xb as a "bottleneck" set through which any run of the chain 
must pass in order to mix; if the bottleneck has low measure, the mixing time is high. 
We will actually consider a larger "bottleneck" set. Set 

X tnv = {IeX : \ins\,\mo\<a{\)M} 

and 

X nt = {IeX : min{|/n£|,|ln0|} > a(A)M}, 

where a(A) is as defined in (J2J). Note that Z& C X tTtv Ul nt . We will show that as long 
as A satisfies @, 

w x (X tnv U X nt ) < (1 + \) M exp 2 {-Q (Ma(\)p(\)6)} , (10) 

from which Theorem 11.11 follows via (J^J) and (J9j) . 

Dealing with w\(X triv ) is relatively straightforward. We begin by observing that 

MA) log 4- < , ^P^L - (11) 
v ; S a(A) " 2 (1 + log(l + A)) v ; 

To see this, first set 

7 (A) = lQg(1 + A) . 
n ; l + log(l + A) 

Note that for all A > 0, < 7(A) < 1. We have a(A) = 7(A)/(441og(l + 1/t(A))) and 
so (TlTl) is equivalent to 

'441og(l + 



-(A) pgrg ^7(A) 



lllo H 1 + *) 1 7(A) 



which is in turn equivalent to 



44 log 1 + —— < 7 (A) 1 + 



I \ 11/2 



7(A); - 7 (A) 

That this inequality holds for all < 7(A) < 1 is a routine calculus exercise. Note 
also that for < x < 1/e, 

x < H(x) < 2x\og- (12) 
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(where recall H(x) = — xlogx — (1— x) log(l— x) is the usual binary entropy function). 
Finally, we use a result concerning the sums of binomial coefficients which follows from 
the Chernoff bounds [6j (see also [3], p. 11): 



m , N 



i=0 



J2( )< 2*® N for c < I (13) 



where [x] denotes the integer part of x. 

Now with the inequalities justified below, we have 



M ' 2 



< exp 2 {2F(ct(A))M + 2a(A)Mlog(l + A)} (14) 

< exp 2 {2M#(a(A))(l + log(l + A))} (15) 

< exp 2 |4Ma(A)log^y(l + log(l + A))| (16) 

< expJM^U (17) 



< (1 + A)t. (18) 

Here (and throughout) we use (<J for J2 i<k (")• In (JHJ), we are using (fl3~|) . which is 
applicable by (j3J). In ( !T5|) we are using the first inequality in ( fl2|) and in (fi~6]) we are 
using the second (again, both of these are applicable by fl3]).) Finally in (flTl) we are 
using ( TTTTl . 

Bounding w x (I nt ) requires much more work. We begin by enlarging X nt slightly. 
Say that I e X(£) is small on E if | [I D £] | < M/2 (recall that for 4a, [A] ={dG 
C : JV(v) C A}), and set 

Xf = {I E l nt : 7 is small on £}. 

Define small on O and Xg* similarly. A simple argument, based on the fact that S 
has a perfect matching, shows that any I G X(S) must be small on at least one of £, 
O, and so we have 

w x {l nt ) < 2max{ WA (Xf ), WA (Xg')} ■ 
We may assume without loss of generality that 

w x {Xf) = max {w x (lf), w x (I$)} 

so that it is enough to show that 

w x (lf) < (l + A) M exp 2 {-fi(Ma(A)/3(A)5)}. 
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For each a > a(X)M and g > a set 

A(a,g) = {AQ8:\[A]\ = a,\N{A)\=g} 



and set 
We have 



Z(a,g) = {IeZ?:In£eA(a,g)}. 



a>a(\)M, g>a 

< MA(a,g))(l + \) M - 9 

a>a(X)M, g>a 

< (i + A) M MAa,g))(i + \y 9 

a>a(\)M, g>a 

< (1 + A) M M 2 max w x (A(a, g)){l + X)' 9 

a>a(X)M, g>a 

The key now is to upper bound w\(A(a, g)). The following theorem (whose proof 
is given in Section |3]) is based on ideas of A. Sapozhenko [HI IT9] . 



Theorem 2.1 Let S be any graph satisfying the assumptions of Theorem We 
have 

w x (A{a, g)) < (1 + A) 9 exp 2 {-Q {(g - a)/3(A))} . 
for any a > a(X)M and any A satisfying (fj]j. 

For a > a(X)M and g > a we have g — a > 5g > Ma(X)5 and so 

wxile*) < (1 + X) M M 2 exp 2 {-Q{Ma{X)(3{X)5)} 

< (1 + X) M exp 2 {-Q{Ma{X)f3{X)5)}. (19) 

To see that the factor of M 2 may be absorbed into the exponent, note that by hy- 
pothesis, 2M > d 2 and so M 2 < exp 2 {O (M log d/d 2 )}, and that combining ([3]) and 
gD we have a(X)(3(X)5 > {l(d~ 3 / 2 hgd). 

Combining ( !T9|) and ( TT8|) we get ( TTUj) and hence Theorem 11.11 



3 Proof of Theorem EE 

For u, v G V and A, B C V we write V(A) for the set of edges having one end in A 
and (if A fl B = 0) V(A, 5) for the set of edges having one end in each of A, B. We 
also write (Ia{v) for |iV(t>) fl A\. 
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Throughout this section, we fix £ satisfying the assumptions of Theorem 11.11 We 
also fix a and g, but we do not assume a > a(X)M. We write A for A(a,g). Given 
AeAwe always write G for N(A) and set t — g — a. Note that for A G A, 

\V(G,S\[A])\=dg-da = td. (20) 

The proof of Theorem 12.11 involves the idea of "approximation" . We begin with 
an informal outline. To bound W\(A), we produce a small set U with the properties 
that each A G A is "approximated" (in an appropriate sense) by some U 6W, and for 
each U 6W, the total weight of those A G A that could possibly be "approximated" 
by U is small. (Each U G U will consist of two parts; one each approximating G 
and A.) The product of the bound on \U\ and the bound on the weight of those 
A G A that may be approximated by any U is then a bound on w\(A). The set U is 
itself produced by an approximation process — we first produce a small set V with 
the property that each A G A is "weakly approximated" (in an appropriate sense) 
by some V G V, and then show that for each V there is a small set W(V A ) with 
the property that for each A G A that is "weakly approximated" by V, there is a 
W G W(V r ) which approximates A; we then take W = Uy eV W(l / ). (Each V G V will 
consist of a single part.) 

The main inspiration for the proof of Theorem 12.11 is the work of A. Sapozhenko, 
who, in [19], gave a relatively simple derivation for the asymptotics of the number 
of independent sets in Q d (in the notation of (JTJ, this is the asymptotics of Z\{Q d ) 
with A = 1), earlier derived in a more involved way in [16J. Our Lemma 13.31 is a 
modification of a lemma in [18], and our overall approach is similar to [19]. See e.g. 
[TO] for another recent application of these ideas. 

We now begin the formal discussion of Theorem 12. II by introducing the two notions 
of approximation that we will use, beginning with the weaker notion. A covering 
approximation for A C £ is a set Fq G 2 satisfying 

F CG, N(F )D[A]. 

The second notion of approximation depends on a parameter 1 < ip < d/2. A 
%p- approximation for A C £ is a pair (F, S 1 ) G 2° x 2 f satisfying 

FCG, SD[4 (21) 

dp.(it) > d - ip Vu G 5 (22) 

and 

d £ \s(u)>d--0 WG0\F. (23) 

Note that if x G [A] then JV(x) C G, and if y G C \ G then iV(y) a\[A], If we 
think of S as "approximate [A] " and F as "approximate G" , (|22j) says that if x G £ is 
in "approximate [A]" then almost all of its neighbours are in "approximate G", while 
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says that if y G O is not in "approximate G" then almost all of its neighbours 
are not in "approximate [A]" . 

Before continuing, we note a property of -^-approximations that will be of use 
later. 

Lemma 3.1 If (F, S) is a if>- approximation for A G A then 

\S\<\F\ + ^-. (24) 
a — if) 

Proof: Observe that \W(S,G)\ is bounded above by d\F\ + if)\G \ F\ and below by 
d\[A]\ + (d-if))\S\ [A}\ =d\S\-iP\S\ [A}\, giving 

\S\<\ F \ | tP\(G\F)U(S\[A])\ ^ 
~ d 

and that each u G (G \ F) U (S \ [A]) contributes at least d — if) edges to V(G, £ \ [A]), 
a set of size td, giving 

2td 

\(G\F)U(S\[A])\<j^-. 

These two observations together give §M$- 1=1 

There are three parts to the proof of Theorem 12.11 Lemma 13.21 is the first "ap- 
proximation" step, producing a small family V of covering approximations for A. 
Lemma 13.31 is the second "approximation" step, refining the covering approxima- 
tions to produce a family W of ^-approximations for A. Finally, Lemma 13.41 is the 
"reconstruction" step, bounding the weight of the set of A's that could possibly be 
'(/'-approximated by a member of W. We now state the three relevant lemmas. We 
will then derive Theorem 12.11 before turning to the proofs of the approximation and 
reconstruction lemmas. 

Lemma 3.2 There is aV = V(a,g) C 2° with 

M 



V < 



< 2 9 lo S d 
— d 



such that each A & A has a covering approximation in V. 

Lemma 3.3 For any Fq G V and 1 < if) < d/2 there is a W = W(Fq, if), a, g) C 
2° x 2 s with 

lyyi < f 2 9^ogd\ {2d 3 g\ogd\ ( 2g\ogd 



< 2£ M < 2t H < td 

— d y \ — ip /v— (d-tp)ii> 

such that any A G A for which F is a covering approximation has a if) -approximation 
in W. 
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Lemma 3.4 Given 1 < ^ < d/2 and 1 > 7 > ^fe /or eac/j (F, 5) G 2° x 2 £ tfia£ 
satisfies we /iai>e 




where the sum is over all those A's in A satisfying FCG and S D [A] . 



Before turning to the proofs of Lemmas 13.21 13.31 and 13.41 we use them to obtain 
Theorem 12. 11 Throughout, we will use (usually without comment) a simple observa- 
tion about sums of binomial coefficients: if k = o(n), we have 



E(I) < (i +0 w.»(3 



i<k 

< {l + 0{k/n)){en/k) k 

< exp 2 {(l + o(l))Jfelog(n/Jfe)}. (26) 

Take ip = yd and 

log(l + A)-^log(d 5 /5) 



7 = 

iug^± -|- A) -|- log^u 
Note that for this choice of ^ and 7 we have 7 > -^7 , and so 



log(l + A)+log(e/ 5 /5) 
3c/ 1 



log .J" r <;iog(^. 



The bound in fTSol) is therefore at most 

(l + A) 9 exp 2 {max{-7tlog(l + A), t (7^7 + 7 ) log(c/ 5 /^) 



d-ip 

(Here we have used (|26p ). For our choice of ip and 7 this is at most 

f '°g(i + A)^aiog(^)-iog 2 (i + A) ygiog(dVg 

(1 + A) exP2 y iog(i + A )+ iog(^) + 1 d-^d 

which in turn is at most 

\ \ Vd J log(l + A)+log(e/ 5 /5)J 



(27) 
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The bounds in Lemmata 13.31 and 13.21 are at most 

log 2 d 
T 



exp 2 jO I + t — -j= — 1 > and exp 2 j \ g- 



(2f 



respectively. For the latter bound, we are using the assumption a > a(M) of Theorem 
and the fact that A > l/\fd, which together imply (via (J3])) that 



glogd 

Combining 022]) with (J^TJ) , we get 



2 ; i....,.m /.^N l0g 2 (l + A) 



, w , f / log 2 d log(d 5 /<5)\ 
^a(^) < (1 + A) 9 exp 2 jO + * ^ j " ' 



log(l + A) + log(d 5 /5) 



Noting that t > Sg always, we find that if A satisfies (jl]) with a suitably large constant 
c, then 

log 2 (l + A) 



^)<( 1 + AN Pl |-^ t - g(1+A)+logTO 

and so we get Theorem 12.11 

We now turn to the proofs of Lemmata 13.21 13.31 and 13.41 

Proof of Lemma Iff.ffl We appeal to a special case of a fundamental result due to 
Lovasz [T7] and Stein [20] • For a bipartite graph F with bipartition P U Q, we say 
that Q' C Q covers P if each p e P has a neighbour in Q'. 

Lemma 3.5 7/T as above satisfies \N(x)\ > p /or eac/i x E P and \N(y)\ < q for 
each y E Q, then P is covered by some Q' C Q mt/i 

IQ'I < (|Q|/p)(l + lng). 

Applying the lemma with T the subgraph of E induced by [A]UG, P = [A], Q = G 
and p = q = d, we find that each A e A has a covering approximation of size at most 
2g log d/d. Taking V to be the set of all subsets of O of size at most 2g log d/d, the 
lemma follows. □ 

Proof of Lemma \3.3l We describe an algorithm, which we refer to as the degree 
algorithm, which produces for input (F ,A) G 2° x 2 s for which F is a covering 
approximation of A (i.e., with N(F ) D [A]), an output {F, S) £ 2° x 2 £ which is 
a -^-approximation for A (i.e, which satisfies (I2ip . (122]) and ( 123]) ). The idea for the 
algorithm is from [18]. To begin, fix a linear ordering of V. 

Step 1: If {w G [A] : dc\F ( u ) > d/2} ^ 0, pick the smallest (with respect to <C) u in 
this set and update F by F < — F U N(u). Repeat this until {u G [A] : dc\F ( u ) > 
d/2} = 0. Then set F 1 = F and S 1 = {u e £ : d Fl (u) > d - d/2} and go to Step 2. 
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Step 2: If {v G 0\G : ds x (v) > ip} ^ 0, pick the smallest (with respect to <C) v in this 
set and update Si by Si < — Si\N(v). Repeat this until {v G 0\G : ds x (v) > = 0. 
Then set S 2 = Si and F 2 = {v G O : ds 2 (v) > ip} and go to Step 3. 

Step 3: If {w G [A] : dG\F 2 ( w ) > 4>} pi°k the smallest (with respect to <C) w in 
this set and update F 2 by F 2 < — F 2 UN(w). Repeat this until {w G [A] : dG\F 2 ( w ) > 
ip} = 0. Then set F = F 2 and S = S 2 P\ {w G £ : d F {w) > d — ip} and stop. 

Claim 3.6 TTie output of the degree algorithm is a %p- approximation for A. 

Proof: To see that F C G and S 1 D [A], first observe that S\ D [A] (or Step 1 would 
not have terminated). We then have S 2 D [A] (since Step 2 deletes from Si only 
neighbours of 0\G), and F 2 C G (or Step 2 would not have terminated). Finally, 
F C G (since the vertices added to F 2 in Step 3 are all in G) and S ^ [A] (or Step 3 
would not have terminated) 

By the definition of S, ( l2~2"j) is satisfied. To verify ( l2"3"j) . note that by definition of 
F 2 , iiy E 0\F 2 then ds\s 2 {y) > d — ip. That y <E 0\F implies ds\s(y) > d — ip now 
follows from the fact that F 2 C F and S 2 ^) S. □ 

Remark 3.7 T/ie a/erf reader may have noticed that if we replace d/2 by ip in Step 
1, then the output of Step 2 is already a ip- approximation for A. The three-step 
algorithm, however, is needed to obtain the right bound on /3(A) in Theorem \l.l\ see 
Remark HOI following the proof of Claim \37R 

Claim 3.8 Fix Fq G V. The degree algorithm has at most 



outputs as the input runs over those (Fq, A) for which A G A and Fq is a covering 
approximation for A. 

Taking W to be the set of all possible outputs of the algorithm, the lemma follows. 

Proof of Claim [378j - The output of the algorithm is determined by the set of li's whose 
neighbourhoods are added to F in Step 1, the set of v 's whose neighbourhoods are 
removed from Si in Step 2, and the set of u>'s whose neighbourhoods are added to F 2 
in Step 3. 

Each iteration in Step 1 removes at least d/2 vertices from G \ F, a set of size at 
most g, so there are at most 2g/d iterations. The w's in Step 1 are all drawn from 
[A] and hence N(Fq), a set of size at most d\Fo\ < 2g\ogd. So the total number of 
outputs for Step 1 is at most 





(29) 
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At the start of Step 2, each x G S\ \ [A] contributes at least d/2 edges to V(G, 8 \ 
[A]), by (12"0|) a set of size dt, so |Si \ [A] \ < 2t. Each v used in Step 2 reduces this by 
at least ip, so there are at most 2t/ip iterations. Each v is drawn from N(Si), a set 
which is contained in the fourth neighbourhood of Fq (Si C N(G) by construction of 
St, G = N(A) and A C iV(F )) and so has size at most d 4 \F \ < 2d 3 g\ogd. So the 
total number of outputs for Step 2 is 



2d 3 g log d 

< 2t 



(30) 



At the start of Step 3, each y G G \ F 2 contributes at least d — if) edges to 
V(G,£ \ [A]), so \G \ F 2 \ < dt/(d — ip). Each w used in step 3 reduces this by at 
least ip, so there are at most dt/((d — ip)ip) iterations. As in Step 1, the w's are all 
drawn from a set of size at most 2g log d, so the total number of outputs for Step 1 is 
at most 

^31)- (3D 

— (d-i/>)ip/ 

Combining (125]), (1501) and (I3TT). the claim follows. □ 



Remark 3.9 The bound in Claim lST^ is at most exp 2 {0 (g log d/d + t\og(d 5 /5)/ip)}. 
If we replace d/2 by ip in Step 1 of the degree algorithm and take the output of Step 
2 to be the final output, then the bound in the claim becomes weaker: 

V — ij, / V — (d-ip)ip ' 

(Each iteration of Step 1 now reduces G\F by at least ip). Using this bound in 
the proof of Theorem li.il instead of the stronger bound given by the three-step degree 
algorithm would ultimately lead to a weaker bound on /3(A) in (0). Step 1 of the degree 
algorithm may be though of as an "initialization" which reduces \G\F\ from 0(g) to 
0(t) without adding much to the "cost" of the algorithm. 



Proof of Lemma \3.4\ Say that S is small it\S\ < g — yt and large otherwise. We can 
obtain all A G A for which F C G and S D [A] as follows. 

If 5* is small, we specify of A by picking a subset of S. If S is large, we first specify 
G. Note that by and the definition of large we have in this case that 

\G\F\<2tip/(d-il))+jt and G\FCN(S)\F, 

so we specify G by picking a subset of N(S) \ F of size at most 2tip/ (d — ip)+ jt (this 
is our choice of G \ F). Then, noting that [A] is determined by G, we specify A by 
picking a subset of [A]. 
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This procedure produces all possible A'b (and more). We now bound the sum of 
the weights of the outputs. 

If S is small then the total weight of outputs is at most 

(1 + A) 9-7 '. (32) 

We have 

\N(S)\F\<d\S\<dg + ^<3dg 

so that if S is large, the total number of possibilities for \G \ F\ is at most 

3 dg 

2tip 
d-i> 

and the total weight of outputs is at most 

3dg 

Combining (132]) and ( 133|) . the lemma follows. □ 
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